Correcting Serial Grammatical Errors based on N-grams and Syntax
نویسندگان
چکیده
In this paper, we present a new method based on machine translation for correcting serial grammatical errors in a given sentence in learners’ writing. In our approach, translation models are generated to translate the input into a grammatical sentence. The method involves automatically learning two translation models that are based on Web-scale n-grams. The first model translates trigrams containing serial preposition-verb errors into correct ones. The second model is a back-off model, used in the case where the trigram is not found in the training data. At run-time, the phrases in the input are matched and translated, and ranking is performed on all possible translations to produce a corrected sentence as output. Evaluation on a set of sentences in a learner corpus shows that the method corrects serial errors reasonably well. Our methodology exploits the state-of-the art in machine translation, resulting in an effective system that can deal with many error types at the same time.
منابع مشابه
A hybrid approach for correcting grammatical errors
This paper presents a hybrid approach for correcting grammatical errors in the sentences uttered by Korean learners of English. The error correction system plays an important role in GenieTutor, which is a dialogue-based English learning system designed to teach English to Korean students. During the talk with GenieTutor, grammatical error feedback and better expressions are offered to learners...
متن کاملGrammar Checker Features for Author Identification and Author Profiling Notebook for PAN at CLEF 2013
Our work on author identification and author profiling is based on the question: Can the number and the types of grammatical errors serve as indicators for a specific author or a group of people? In order to detect the grammatical errors we base our approach on the output of the open-source library LanguageTool. In the case of the author identification we transform the problem into a statistica...
متن کاملGrammatical Error Correction of English as Foreign Language Learners
This study aimed to discover the insight of error correction by implementing two correction systems on three Iranian university students. The three students were invited to write four in-class essays throughout the semester, in which their verb errors and individual-selected errors were corrected using the Code Correction System and the Individual Correction System. At the end of the study, the...
متن کاملCategory-based Statistical Language Models Synopsis
Language models are computational techniques and structures that describe word sequences produced by human subjects, and the work presented here considers primarily their application to automatic speech-recognition systems. Due to the very complex nature of natural languages as well as the need for robust recognition, statistically-based language models, which assign probabilities to word seque...
متن کاملIranian EFL Learners' Written Grammatical Errors: Different Levels of Language Proficiency
Errors are one of the enigmatic parts in the process of foreign language (L2) learning as they are extremely versatile at each and every stage of the language learning proficiency. The present study, therefore, was an attempt to reveal Iranian EFL learners’ grammatical errors in writing at two levels of proficiency, namely lower intermediate and advanced, and then to investigate whether there w...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید
ثبت ناماگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید
ورودعنوان ژورنال:
- IJCLCLP
دوره 18 شماره
صفحات -
تاریخ انتشار 2013